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Abstract 

We consider goodness-of-fit tests of symmetric stable distributions based on weighted integrals of 
the squared distance between the empirical characteristic function of the standardized data and the 
characteristic function of the standard symmetric stable distribution with the characteristic exponent 
a estimated from the data. We treat a as an unknown parameter, but for theoretical simplicity we 
also consider the case that a is fixed. For estimation of parameters and the standardization of data 
we use maximum likelihood estimator (MLE) and an equivariant integrated squared error estimator 
(EISE) which minimizes the weighted integral. We derive the asymptotic covariance function of the 
characteristic function process with parameters estimated by MLE and EISE. For the case of MLE, 
the eigenvalues of the covariance function are numerically evaluated and asymptotic distribution of the 
test statistic is obtained using complex integration. Simulation studies show that the asymptotic dis- 
tribution of the test statistics is very accurate. We also present a formula of the asymptotic covariance 
function of the characteristic function process with parameters estimated by an efficient estimator for 
general distributions. 

1 Introduction. 

The family of stable distributions is one of the most important classes of distributions in probability 
theory. The general central limit theorem asserts that if a suitably normalized sum of independently and 
identically distributed (i.i.d.) random variables has a limit distribution, only possible limits are the stable 
distributions (Chapter 6 of Feller (1971)). Concerning statistical inference, because of their attractive 
properties such as heavy tails, many models based on stable distributions have been considered in both 
social and natural sciences (Samorodnitsky and Taqqu (1994), Uchaikin and Zolotarev (1999), Rachev and 
Mittnik (2000)). Therefore it is important to consider goodness-of-fit tests of stable distributions. However 
few researches on goodness-of-fit tests of stable distributions have been conducted due to the difficulty in 
expressing their density functions explicitly. The purpose of this paper is to propose goodness-of-fit tests 
based on the empirical characteristic function, since the characteristic functions of stable distributions are 
explicitly given. For past researches on goodness-of-fit tests of heavy-tailed distributions using empirical 
characteristic function approach, see Giirtler and Henze (2000) and Matsui and Takemura (2005). Both 
papers treat Cauchy (a = 1) distribution which is one of the stable distributions. 

Let f{x; fj,,a,a) denote the symmetric stable density with the characteristic function 

$(t) = ex.p{ifit - \at\"), 
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where the parameter space is 



^^ = {-oo < |Lt < oo, cr > 0, < a < 2}. 

Here a is the characteristic exponent, jjl is the location parameter and a is the scale parameter. For the 
standard case (/x, cr) = (0, 1) we simply write the characteristic function as $(t;a) = exp(— |f|") and the 
density function as f{x;a). In this parameterization stable distributions form a location-scale family for 
each value of a, i.e., 

fix; IX, a, a) = -f{- — -■,a). 

a a 

In order to cope with more general situation or for notational convenience we also write the parameters 
as 

e= {61,62,6:,) = {pi,a,a) 
and write corresponding density, distribution or characteristic function as 

f{x;n,a,a) = f{x;e), F{x; iJ,,a,a) = F{x;0), /x, cr, a) = $(a;; 6*). 

Here we note that ^ is a vector. 

In this paper we often differentiate functions of the parameter 6 and the data x with respect to x, fj,, a 
and a. Since we will consider affine invariant (location-scale invariant) tests, it is often sufficient to 
evaluate the derivatives at the standard case (//, a) = (0, 1). For example we use the notation 

d d 
ffi{x;a) = — /(a;;/x,(7,Q!)|(^,(^)=(o,i) or f'{x;a) = — /(a:;;/x,(7,Q;)|(^,c^)=(o,l)■ 



Concerning the characteristic function we also use Vo^{t;9) = {^ij,{t;d),^a{t',d),^a{t',6)) where, for 
example, 

d_ 
djj, 

For standard case [ji, a) = (0, 1) we write 



d 

^fi{t;oi) = —$(*;//, £7, a)|(^,^)=(o,i). 

Given a random sample xi, . . . ,Xn from an unknown distribution F, we want to test the null hypothesis 
Hi that F belongs to the family of stable distributions f{x;n,a,a) and the null hypothesis H2 that F 
belongs to the family of stable distributions f{x; fi, a, a) with a = aQ fixed. Note that Hi D H2- Here we 
explain our proposed procedure for testing Hi, because for H2 we can simply replace a by ao- 

As remarked above stable distributions form a location scale family and we consider affine invariant 
tests. The proposed tests are based on the difference between the empirical characteristic function 

(1.1) $„(t) = $„(i;/i,<T) = -^exp(%j), yj = . ^ , 

of the standardized data yj and the characteristic function with a estimated from the data 

$(t) = $(t;a) = e-l*l". 
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Here fi = ftn = fln{xi, . . . , Xn), a = (Jn = an{xi, . . . , Xn) and a = q„ = a„(xi, . . . , x„) are affine 
equivariant estimators of /i, a, a satisfying 

/lnia + bxi,...,a + bxn) = a + 6/i„(xi, . . . , 
ania + bxi,...,a + bxn) = ban{xi, . . . ,Xn), 
ania + bxi,...,a + bxn) = an{xi, ■ ■ ■ ,Xn), 

for all — oo < a < cxD and 6 > 0. 

As equivariant estimators we consider maximum likelihood estimator (MLE) and an equivariant in- 
tegrated squared error estimator (EISE) defined in (|2.6)) below. The reason for considering MLE is its 
asymptotic efficiency and the reason for EISE is that its definition is similar to our proposed test statistic. 

Following Giirtler and Henze (2000) and Matsui and Takemura (2005) we propose the following test 
statistic 

/oo 
l^nit) - e^\'\'^fw{t)dt, w{t) = e-"l*l, K > 0. 
-oo 

Dn^K is the weighted L^-distance between ^n{t) and the characteristic function e"'*' of f{x;a) with 
respect to the weight function w{t) = e"*^'*', k > 0. This weight function is chosen for convenience, so 
that we can evaluate the asymptotic covariance function of the empirical characteristic function process 
under Hi. 

The test statistic Dn.K has an alternative representation, which is useful for obtaining its asymptotic 
distribution. 

(1.3) Dn,K = / \Zn{t)\ anw{ant)dt, 

J — oo 

where 

1 

(1.4) Znit) = -j= ^ |cos(tx-,) + zsin(tej) - e"''^"*'"" (cos(t/i„) + i sin(t/i„)) | . 

Znit) corresponds to the empirical characteristic function process. 

Our test statistic -Dri,K is a quadratic form of the empirical characteristic function process. Although 
we derive an explicit form of the asymptotic covariance function of the empirical characteristic function 
process, it is not trivial to derive the asymptotic distribution of Dn.K under H\ and H2 from the covariance 
function, especially when the parameters are estimated. See chapter 7 of Durbin (1973a) and Durbin 
(1973b) for tests based on empirical distribution functions with estimated parameters and see Giirtler 
and Henze (2000) and Matsui and Takemura (2005) for tests based on empirical characteristic functions. 
As in Matsui and Takemura (2005) we evaluate the asymptotic distribution of Dn./t for the case MLE 
by numerically approximating the eigenvalues of the asymptotic covariance function. By numerically 
evaluating the asymptotic distribution we can also check the convergence of the finite sample distributions 
which we obtain by Monte Carlo simulations. 

Concerning EISE, as shown below, the asymptotic covariance function of the empirical characteristic 
function process is very complicated. Furthermore we found that Monte Carlo simulation involving EISE 
is very time consuming. Therefore in this paper we show theoretical results on our proposed test statistic 
involving EISE and leave numerical studies to our subsequent works. 
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This paper is organized as follows. In Section [2.11 we first define and summarize properties of MLE 
and EISE. Then in Section [2.21 we state th.eoretica.1 results on asymptotic distribution of Dyi ,^^ under Hi 
in Theorem 12.31 and Theorem 12.51 and results under H2 as the corollaries of these theorems. Numerical 
evaluations of asymptotic critical values of -Dn,K under Hi and H2 for MLE are discussed in Section IHl 
Simulation studies of MLE and corresponding test statistic are given in Section 4, including the 

study of finite sample power behavior in Section 14.31 



2 Main results 

2.1 Estimators and their asymptotic properties 

For our purposes we need asymptotic covariance matrices and "asymptotically linear representations" (AL 
representations) of the estimators. We describe asymptotic properties of maximum likelihood estimator 
(MLE) following DuMouchel (1973). We also define an equivariant integrated squared error estimator 
(EISE) and give asymptotic properties of EISE. For MLE explicit expressions of the asymptotic covariance 
matrix and AL representations are given in the Cauchy case a = 1. 

As shown in DuMouchel (1973), MLE is asymptotically normal and asymptotically efficient. The 
likelihood equation is given by 



00 



(2.1) ?^ = ^ y J—, / e ''^^)<^Jt;a)dt = 0, 



00 



Or -I poo . / xj 



a: 



00 



($^(t; a), ^^{t; a), $a(t; a)) = fite"!*!", -e"'*'" \t\'^a, -e"'*'" \t\'^ log |t| 



where 



EISE is an affine equivariant version of the ISE (integrated squared error) estimator proposed by 
Paulson et al. (1975). The original ISE estimator of Paulson et al. (1975) is not equivariant. Robustness 
and efficiency of ISE estimators of location and scale parameters are discussed in Thornton and Paulson 
(1977) for the normal case and in Besbeas and Morgan (2001) for the Cauchy case. EISE is based on the 
standardized empirical characteristic function. Let 

1 x - / . Xj — jJL 



^n{t; /U, cj) = - ^ exp (it 



n — ' \ cj 



which is the same as (|1.1|) with //„ and (T„ replaced by /i and o. Write 

/oo 2 
|$„(t;/x,(T) -e-l*l°| w{t)dt, 
-00 

where we use the following weight function 

(2.5) w{t) = exp{-u\tf), u>0. 
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Here we call a weighting index and V weighting constant. EISE (/im (7„, a„) is defined to be the minimizer 
of cr, a): 

(2.6) Q(An, o-n, On) = min Q[[i, a, a). 

It is easy to see that EISE is affine equivariant by definition. Note that the weighting constant k in the 
test statistic (|1.'2() and the weighting constant u in ()2.5|) for EISE may be different. In our theoretical 
results on EISE we can treat more general weighting functions, i.e., w{t) > is an arbitrary even function. 
However for performing goodness-of-fit tests, it seems natural to set aQ = a and v = k. The integral Q{9) 
can be calculated as 

j — ^cos {t{xj -Xfc)/cr) ^cos {t{xj -/i)/cr)e"l*l° + e"^'*'" \w{t)dt. 

-~ ^ j,k ^ j=i J 

The estimators satisfy the following estimating equations = dQ/d^ = dQ/da = dQ/da 44> = Qf_i{9) = 

Qa{0)=Qa(.O). 

(2.7) Q^ie) = - r !.-j2^m{tixj-fi)/a)te-\'\"\wit)dt, 
x{t(;'(cj|t|)cr|t| +w{at)} + 2|;i ^cos {t{xj - fi)) - e"''^*'" |a|(Tt|°e"l'^*l°u;(cjt) 



(2.8) Q^{e) 



dt, 



(2.9) QaiO) 



r | - ^cos {t{xj - fi)/a) - e-l*l"|e-l*l"|tr log \t\wit)dt, 



where w'{x) = dw{x) / dx. Note that in case of Qa{S) differentiation was done after the transformation 
t at. 

In the rest of this paper we use the following notations. — > means weak convergence of random 

P 

variables or stochastic processes, — > means convergence in probability. 

The asymptotically linear representations (AL representation) give an method of approximating 
asymptotic behavior of the estimator by sum of functions of i.i.d. random samples. For the standard 
symmetric stable case f{x; a) we need following three expressions, 



1 " 

^/nfln = —^y^h{Xj)+rin, 

1 " 

1 " 

^/n{an-a) = —=S^h{Xj) + r^n, rin,r2n, rsr, 



For the case of MLE, AL representations are given in terms of the score functions ( ()2. 112. 3(1 ) and the 
Fisher information matrix. The proof is standard and omitted. 
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Theorem 2.1 Let I{9) denote the Fisher information matrix 



m 



hi 
I22 hs 
I32 hi 



hj{6) — —Eg 



(9^ log/(x,( 



The AL representations le{x) = {l^{x)^h{x),la{x)) at the standard case (/U,(t) = (0,1) are given by 
le{x) = I~^{6)hg{x), where hg{x) are 



(2.10) 
(2.11) 
(2.12) 



hfj,{x) 

ha{x) 
haix) 



1 



2vr/(x;a) 
a 



27rf{x;a) 
1 



zfe-*'^e-l'l dt, 



^-itx^-\t\ i^ja^^^ 



-00 
00 



log \t\dt. 



2Trf{x;a) 

Concerning EISE we can employ standard theory of [/-statistics. The proof is given in Appendix 1X1 

Theorem 2.2 Define a 3 x 3 symmetric matrix 

(2.13) A = A{a) = A(e)|(^,,)=(o,i) 



by 



A12 
Au 



A,2 = 0, 



e-'^\^\"t'^w{t)dt, 



A22 = I e-2|*l"|t|2"^«(t)dt, 



A23 
A33 



00 
00 

a I e 

-00 



log\t\w{t)dt, 



e-2|*r|t|2"(iog|t|)-^„(t)dt, 



and define hQ{x) = {h^{x), hf^{x), haix))' by 

/oo 
tsm{tx)e~\^\°'w{t)dt, 
-00 

/oo 
(cos(tx) - e-l*l")e"l*l"|t|°tt;(f)dt, 
-00 

/oo 
(cos(te) - e^l*l")e"l*l°|t|"log|t|tt;(t)(it. 
-00 

For EISE the AL representations le{x) = (l^ix), hix),la{x)) at the standard case {fi, a) = (0, 1) are given 
by 



(2.17) 



le{x) = A{a) 'he{x) 



and their asymptotic covariance matrix at the standard case is given by 

/ H,, \ 

A-'E[he{X)h'g{X)]A-'' = A-n H^^ \ A~^' = 
















Hera 










Ju 











J22 


J23 





J32 


J33 



where each element of H = E[ho{X)hQ{X)] is 

/OO /"OO f -l ^ 
J |^(e-l^-*l" - e-l^+*l'^)| e-(l^l'^+l*l'^)st w{s)w{t)dsdt, 

/CO /"OO -1 ^ 

/OO f'OO ( -\ ^ 
y Ji(e-l«-*r +e-l^+*l'^)-e-(W''+l*l'')U-(l^l"+l*l")|striog|s|log|^^ 

Note that the above AL representations and asymptotic matrices involve definite integrals, which 
require numerical integration. But for some special cases like Cauchy (a = 1) we can calculate several 
integrals analytically. Analytic expressions are useful for checking correctness of numerical calculations 
concerning Theorem 12. II We give the following Corollary for the case of a = 1 and MLE. 

Corollary 2.1 Let 7 = 0.577216 denote Euler constant. In the Cauchy case (a = 1) and MLE, at 
(/U, cr) = (0,1), the AL representations are given as le{x) = I~^hg{x)\a=i, where 



ha{x) 



2x . _ - 1 

1 



hfji{x) — 2 I 1 ' hcr{x) — 2 I 1 

+ 1 + 1 



1-X2 , 2 



x2 + 1 



- log(x' + 1) - 1 + 7 



2x 

H — ^ arctan x, 

x^ + 1 



hi=l22 = \, /23 =/32 = ^(l-7-log2), l33 = ^|y + (7 + log2-l)2 
A similar result is given in Section 6 of Matsui and Takemura (2006) and the proof is omitted. 
2.2 Asymptotic theory of the proposed test statistics 

In this section theoretical results on asymptotics of the proposed test statistic Z?n,K are obtained. From 
another expression of Dn./t (|1-3|) we derive weak convergence of Zn{t) and weak convergence of test statistic 
Dn^K in the following two theorems. These results correspond to those of Cauchy case stated in Matsui 
and Takemura (2005) where parameter a = 1 is fixed. As a special case we also describe the Cauchy case 
involving estimation of a in the corollary below. Furthermore a general formula of asymptotic covariance 
function of the empirical characteristic process with parameters estimated by an efficient estimator is 
given in the latter part of this section. As already remarked several times, we can assume without loss 
of generality that Xi, . . . ,Xn is random sample from f{x;a) because of affine invariance of our tests. 
Following Giirtler and Henze (2000) we use the Prechet space C(R) of continuous functions on the real 
line R for considering the random processes. The metric of C(R) is given by 

00 



pix,y) = '^2-^ 



^ 1 + Pj(x,y)^ 
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where Pj{x,y) = max^Kj \x(t) - y{t)\. 

We first give the asymptotic covariance function of the empirical characteristic function process with 
parameters estimated by MLE and EISE. In the fohowing theorems the elements of the inverse of the 
Fisher information matrix 1(9) and the matrix A in (|2.1.S)) are denoted with superscripts and A^^ . 

Theorem 2.3 Let Xi, . . . , Xn be i.i.d. f{x; a) random variables and let Zn be defined in l \1.4l - Then Z„ 

Z in C(R), where Z is a zero mean Gaussian process with covariance functions given below. 
MLE : 



(2.18) 



r(s,t) = e-i*-^i"-e-(i*i"+i^n_|jii,t + j22|^^|a^2 

+/23|st|-a log \st\ + I'^\str log \s\ log |t| |e-(l*l"+l^l"). 



EISE : 

r(.,i) = e-l*-l"-e-(l*l"+N") 

+ {Just + J22a2|st|" + J23a|sfr(log \s\ + log \t\) + Jask^r log \t\ log |s|}e-(l*l"+l'^l") 

+ {{B^A^' + B^A'')a{\t\'' + \sn + (B^A^^ + B^A'^){\tr log \t\ + Is^ log \s\)} e-(l*l"+l^l") 

POO 



oo 



/oo 
e\'-^\''-^^\"\u\''w{u)du[2] 
-oo 

/oo 
e\'-^\''~\<\u\''\og\u\w{u)du[2] 
-oo 

where [2] after a term means symmetrization with respect to s andt, i.e., 5(s,t)[2] = g{s,t) + g{t, s), and 

a TOO ^ 

e~^'"' lul"^ log \u\w{u)du, Bfj = a e~^'"' \u\'^w{u)du. 

-oo J —oo 

As a corollary the asymptotic covariance function for the case of fixed a is given as follows. 

Corollary 2.2 Under the same conditions of Theorem \2.Sl when a is fixed, Zn{t)-^ Z in C(R), where 
Z is a zero mean Gaussian process with covariance functions given below. 
MLE : 

r(,,t) = e-l*-^l" -e-d^l'^+N'^) _ (/ii,t + /22|^^|a^2)g-(|r+N"). 

EISE : 

r(s, t) = e-l*-«l" - e-(l*l'^+l^l") + {Just + J220?\str + ^''i?.a(|r + |sr)}e-(l*l"+l^l'^) 

e-l^-"l"~l"l"nu;(n)dn[2] - A^''a^\tre~\'\" / e-\'-^\''-\'^\"\urw{u)du[2]. 

-oo J —oo 

The following corollary gives the asymptotic covariance function when the true distribution is Cauchy 
C{fi,a), but the characteristic exponent a is estimated by MLE. 
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Corollary 2.3 Let Xi, . . . , Xn be i.i.d. C(0, 1) random variables and let Zn be defined in l \1.4h where 

parameters are estimated by MLE. Then Zn Z in C(R), where Z is a zero mean Gaussian process 
with covariance function given below. 

(2.20) Tcis, t) = e-l*-^l - {1 + 2{st + |sf|)}e-l^l^l*l 

12 

^ {log \s\ + (7 + log2 - 1)} {log \t\ + (7 + log 2 - 1)} |st|e-l"l-l*l. 

Furthermore for efficient estimations including MLE, we can derive more general result after some 
formulations. We assume parameter space is p-dimensional. First, WG d.GfinG cin GfficiGiit Gstinicitor 0^ 
of 9q as such that 

i=i 
1 " 

(2.21) := _ V/£;(X, ;0o)+en, 
p 

where e„ — > 0. This definition coincides with formula (20) of Durbin (1973b), though his definition is 
given for nuisance parameters. He also gives conditions that the estimator On satisfies formula (|2.2H) under 
more general arguments including local alternatives (see the condition (A3) of Durbin (1973b)). Note 
that Durbin (1973b) considers estimation only for nuisance parameters On including local alternatives 
and the parameters of interest (the null hypothesis) are not estimated. However in our case the whole 
parameters are estimated without nuisance parameters because the null hypothesis of Hi is the whole 
parameter space. 

Second, we modify the conditions (iv) of Csorgo (1983) as 
{iv)* Ie{x]0) in h2.21\) is a p- dimensional Borel measurable function, E\Ie{Xi;Oq)\ = (0, ...,0), and 
= E\Ie{Xi;0q)1'^{Xi;0q)\ is finite and positive definite. 

Theorem 2.4 Let Xi, . . . ,Xn he i.i.d. F{x;Oq) random variables and let k{x,t) = cos(tx) + isin(tx). 
Consider the kernel transformed empirical characteristic process 

(2.22) Zn = j kix,t)d{V^{Fnix)-F{x;On))}, 

where Fn{x) denotes the empirical distribution function. Then Zn — Z in C(R) under the conditions 
(i)* , (ii)* , (v) and (vi) of Csorgo (1983) where 1{-;0q) is replaced by Ie[-'-,Oo) o,nd {iv)* . Here Z is a zero 
mean Gaussian process with covariance function 

(2.23) T{s, t) = $(s - t; ^o) - Hs; Oo)Mt^ - VeHs; Oq)' L'\0)V e^t; Oq) , 

where <I>(t;0o) 'is the characteristic function and 'V0^{t;Oo) is the derivative of ^{t;Oo) with respect to 
parameter vector 0. 

Theorem 12.41 is interpreted as the Fourier kernel transformed version of Theorem 2 of Durbin (1973b) 
with nuisance parameters corresponds to the estimated null hypothesis. In our subsequent works we will 
consider extension of Theorem 12.41 to the case of local alternatives. 

Finally we state the following theorem concerning the weak convergence of -Dn.K- 
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Theorem 2.5 Under the conditions of Theorem 2.1 

-oo J — oo 

The proofs of the above theorems are given in Appendix^ We omit the proof of Theorem 12.51 since 
after obtaining Theorem 12.31 the proof is the same as that of Theorem 2.2 of Giirtler and Henze (2000). 

3 Approximation of the asymptotic critical values of the proposed test 
statistics 

In this section we investigate the distribution of for MLE. We briefly explain how to obtain the 
characteristic function of D^- Detailed treatments of this approach in statistical applications are given in 
Tanaka (1996) or Anderson and Darling (1952). Since the characteristic function of contains infinite 
product of functions of eigenvalues which can not be evaluated analytically, we approximate eigenvalues 
by theory of homogeneous integral equations of the second kind and the associated Fredholm determinant. 
Then utilizing complex integration, we invert the characteristic function and obtain series representation 
of the distribution of -D^- Detailed theoretical argument of inversion process is given in Slepian (1957). 
Actual computational approximations are given in the next section. At the end of this section we transform 
our kernels T{s,t) on to kernels K{s,t) on [—1, 1]^ for convenience in numerical computation. 

In this paper we omit consideration of of EISE since kernels T(s,t) have definite integrals and 
we need many numerical approximations. On the other hand for the case of MLE we can utilize past 
researches on Fisher information in DuMouchel (1975), Nolan (2001) and Matsui and Takemura (2006) to 
confirm the accuracy of our computation. For k > 1 we can use the following standard form of Mercer's 
theorem. 

Theorem 3.1 (Mercer's Theorem, Chapter 5 of Hochstadt (1973)) Let K{s,t) be the kernel of 
a positive self-adjoint operator on 1, 1] and suppose that K{s,t) is continuous in both variables. Then 

oo ^ 

(3.1) K{s,t) = y2-fj{s)fj{t), 0<Ai<A2<---Too, 

where Xj is an eigenvalue and fj{t) is the corresponding orthonormal eigenfunction of the integral equation 

(3.2) A l'^K{s,t)f{t)dt = f{s). 
The series ()3.1() converges uniformly and absolutely to K{s,t). 

If K < 1 we need to deal with kernels which are not continuous at two points (—1,-1) and (1, 1). We can 
see discontinuity at (—1, —1) and (1, 1) in Figures ^ 01 and [S] in the case k = 1. On the other hand there 
is no discontinuity at these points for k = 2.5 as shown in Figures 121 and El However as in Anderson 
and Darling (1952) the following version of Mercer's theorem by Hammerstein (1927) is useful. 

Theorem 3.2 Suppose that the covariance function K(s,t) of a Gaussian process is continuous except 
at (—1,-1) and (1,1) with dK{s,t)/ds continuous for \s\,\t\ < l,s ^ t, and bounded in \s\ < 1 — e for 
every t £ [—1, 1] and every e > 0. Then the right hand side of (|3.1|) converges uniformly in every domain 
in the interior o/ [— 1, 1]^. 
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We apply the above theorems to a continuous covariance function K{s, t) of a zero mean continuous 
Gaussian process Z{t), —l<t<l, with a finite trace J^-^ K{t,t)dt < oo. Let Xi, X2, ■ ■ ■ , be i.i.d. 
standard normal random variables. Then the series 

00 ^ 

converges in the mean and with probability one for each t G (—1,1). Then Y{t) is a Gaussian process 
with Ey(t) = and E[y(t)y(s)] = K{s,t). Thus Y(t) defines the same stochastic process as Z{t). Let 



(3.3) 



■^-1 1- j=i V^J J j=i ^3 



Y^{t)dt 



1 



The characteristic function of W'^ is given as 

JuW^ 



E(e" 



E[exi>{iu^X]/Xj)] = ]jE[exp(mX|/Aj)] = ]^(1 - 2iu/Aj)^5. 
j=i j=i j=i 



The characteristic function has an alternative expression 1/y^ D{2it) where -D(A) is the associated Fred- 
holm determinant 



.7=1 ^ ^ ^ 



There are two problems in treating the characteristic function in the form of the Predholm determinant. 
One is in the approximation of D(X) itself and the other is in the Levy's inversion formula. 

In the case of stable distributions the Fredholm determinant can not be explicitly evaluated and 
approximation of -D(A) is needed as in the Cauchy case in Matsui and Takemura (2005). We approximate 
D{X) by discretizing the homogeneous integral equation and approximating eigenvalues of resulting finite 
system of linear equations. Then the integral equation 1)3.2(1 is approximated by the following finite system 
of linear equations 



N 



where 



K 



Then the Fredholm determinant is approximated as 















f = 













Dn{X) 



I--K 

N 



N 



where l/Xj = l/Xj{N) are the eigenvalues of K/N. This method is called a quadrature method and we 
state a version of Theorem 3.4 of Baker (1977) concerning the convergence of eigenvalues. 
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Theorem 3.3 Let the eigenvalues Xj{N) be obtained by the quadrature method. If K(s,t) is positive 
definite and continuous in s,t & [—1,1], 



for each j and 
for each A. 



lim Xj{N) = Xj, 



lim Dn{X) = D{X) 

TV— >oo 



Remark 3.1 The covariance functions in (|3.7|) and (|.S.8|) below do not satisfy the conditions of this theo- 
rem if K < 1. However this theorem gives only a sufficient condition for the convergence. In our problem 
the values o/Z)jv(A) seem to converge as we increase N even for the case k<1 and the resulting value is 
consistent with our Monte Carlo simulations. Therefore in the next section we use the approximation of 
this theorem even for the case k<1. It remains to theoretically prove that the approximation is valid for 
the case k <1. 

The probability density function of the proposed statistic is given by inverting the characteristic 
function = 1/ D{2it). Since integrand of inversion formula is often wildly oscillating and converges 
to slowly, the ordinary numerical integration is difficult (Section 6.1 of Tanaka (1996)). However we 
can utilize theory of complex integration in Slepian (1957) and invert $(f) very efficiently. This method 
of inversion does not seem to be commonly implemented in statistical computations. 

Assuming that the kernel K{s, t) has no multiple eigenvalues and the number of the eigenvalues are 
infinite, the density and distribution of are calculated as 



/d(x) 



vr 



-1/ 



k=l 



-xy 



n 



oo 



--dy, 



Fd{x) 



vr — ' 



fe=i 



-xy 



--dy. 



1 



The series are alternating and convenient for checking convergence. Although the above representa- 
tions have singularity at each endpoint of integral range, we can remove the singularity by the following 
transformation y ^ z as m. Slepian (1957). The k-th integral is transformed by 



(3.4) 
Then 



y 



^2k ^2k~l\ , 1 f X2k , A2fc-1 
COS TTZ -\ 

2 2 2 \ 2 2 



< z < 1. 



dy 




^2k-^ 



Hence we obtain the following representations suitable for numerical integration. 



(3.5) 



oo „ 
k=l ^0 



-xy 



^2k _ r.. 

2 y 



dz. 



n 
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(3.6) Fn{x) = l-Y,i-^)' / 



\ 2 ^7 V 2 



y\nW=i 



where y is given by formula (|3.4)1 . 

Finally we will make a transformation of variable and map into [—1,1]^ in order to satisfy the finite 
interval condition of Mercer's theorem. This transformation also is useful for numerical approximation of 
eigenvalues. For deriving the distribution of D^, we have to incorporate the weight function e"'''*' into 
the kernel, i.e., we consider the following kernel 

r(s,f)e-t(H+l*l). 

Now we make the transformation s ^ u defined by 

s = — sgn u • log(l — — 1 < n < 1. 

Then ^ 

ds = ; — :du. 

1 - |n| 

The kernel and the eigenfunctions are transformed as 

r(.,t) ^ ^(^^^)^r(-sgn_^.log(l-H),-sgn^;.log(l-H)) 



V(i-H)(i-H 

f I -s fj{-sgnu ■ log(l - \u\)) 



u 



Eigenvalues of ()3.2() do not change by this transformation and so does Fredholm determinant. After this 
transformation, writing s,t instead of u,v again, we have the following kernels on [—1, 1]^: 

(3.7) Hi:K{s,t) = |e-|-'^g'^^-i°g(i-l*l)+sg'i*-i°g(i-l*l)l" - e"! 

— (^I^^ sgns • sgnt • log(l — |s|) log(l — |t|) 

w/22|iog(i-|5i)iog(i-|t|)r 

+a/23| iQg(i _ 1^1) i„g(i _ I iQg(i _ 1^1) i^g(i _ 1^1)1 

+/33| log(l - |s|) log(l - \t\)r log I log(l -\S\)\- log I log(l - |t|)|) 

xe-IMi-N)l'^-|iog{i-|*|)|'^ 1 _ _ 1^1)} V. 

(3.8) H2:K{s,t) = |e-|-'5gn^-log(i-|s|)+sgnMog(i-|i|)r _ g-| log{i-|s|)r-| log{i~|t|)r 

-(/"sgns • sgnt • log(l - |s|)log(l - \t\) +a^I^'^\ log(l - |s|)log(l - 

xe-|i°s(i-N)r-|iog(i-|t|)r I {(1 _ _ 



13 



Figure 1: MLE-Hl (a = 1.0, ac = 1.0) 



Figure 2: MLE-Hl {a = 1.0, k = 2.5) 




3.1 Numerical approximation of critical values of 

We approximate the eigenvalues in ()3.6|1 by the quadrature method for the kernels ()3.7() and ()3.8() . 800 
eigenvalues are calculated by the above simple algorithm for the case of {k = 1.0,2.5,5.0, 10.0}. We do 
not consider k = 0.1 and 0.5 since convergence of infinite integral of -Dn.K becomes very slow for small 
weights and we had numerical difficulties. In Matsui and Takemura (2005) the approximated sum of 
X^j^i did not converge to E[Z)k] first for small k. We mention that, unlike the Cauchy a = 1 case, 
we did not observe multiple eigenvalues for other symmetric stable distributions (a ^ 1). 

The infinite series and infinite products in (|,3.6|) have to be approximated by a finite sum and finite 
products. Let / and m (l < m) denote the number of terms in the sum and the products respectively. 
Then we can approximate Fd{x) as 



The series is alternating. Therefore the range of the critical value can be obtained by substituting 
lower bound of each positive term and upper bound of each negative term separately. However deriving 
analytical bound of integral of each term of series is difficult. Hence for accuracy of approximation of 
Fd{x) we depend on numerical confirmation. First, we found that finite interval quadrature (QAG) is 
very accurate if we set relative error bounds below 10~^ and the convergence of series is very fast. The first 
10 terms of series are enough to obtain 1% relative accuracy for most k and x large enough to calculate 
critical values. Further for m > 300 the value of Fd{x) does not change with m for most k and large x. 
Finally the approximated percentage points (10% and 5 %) coincide with simulation results in Section^ 
We give Tableland Table [21 for approximate percentage points of under hypothesis Hi and H2 
respectively. Intervals of a G (0.5,2.0) for Hi and a G (0.5,2.0] are 0.1. In each table, we set m = 500 
for K < 5.0 and m = 300 for k = 10.0, and set I = 25 for n < 2.5 and / = 10 for k > 5.0. Trial and 
error indicates that since for large k is very small, we use only accurate large values of 1/Aj among 
800 values. We also plot the percentage points of each under Hi in Figures 171101 The values of 
percentages are continuous with respect to a 7^ 2. For large values of k, percentage points of small a are 
large compared to that of large a. 

4 Computational studies 

In this section we give various computational results. Since the exact finite sample distributions are 
difficult to obtain, first we approximate the percentage points of under Hi and H2 respectively by 
Monte Carlo simulation. Then the power of testing H2 for the finite sample is evaluated. 

4.1 Maximum likelihood estimation 

For MLE we maximize likelihood function in parameter space — oo<//<c>o,fT>0,0<a<2by 
utilizing the first derivatives of each parameter. Maximizations are done by the method based on M.J.D. 
Powell's TOLMIN from IMSL library. Although explicit forms of the density and the derivatives are not 
available for stable distributions, the method suggested by Matsui and Takemura (2006) which improves 
the original method of Nolan (1977,2001) gives very accurate approximations. We use median as the initial 
value for /i and for a and a we do grid search and obtain the parameter values which maximize median 
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inserted log-likelihood among, say, 2000 points. Note that although we can set a = 1 for the initial value 
of Q, the convergence is slow compared with grid search based initial values. Base on 1000 Monte Carlo 
replications, the values of the estimators and the simulated information matrices I {6) = Cov[9i,6j]~^ are 
given in TableElfor a S {0.8, 1.0, 1.5, 1.8, 2.0} and the sample size n G {50, 100, 200}. We put true values 
at the upper row of each values of a in Table 01 Except for a = 2, simulated values of I{6) coincide 
with theoretical values I (9) which are given in Matsui and Takemura (2006). Though information of a 
at a = 2.0 is oo and asymptotic normality is not guaranteed (DuMouchel (1973)), we can estimate a 
computationally at a = 2. Interestingly we observe a = 2.0 for 80%-90% of the cases for n > 100 and 
we also observe some downward bias. 

4.2 Finite sample critical values of D„ ^ 

We omit the case a = 2 for both Hi and H2, since there are many papers concerning testing normality, 
e.g., Henze and Wagner (1997), Csorgo (1986,89) or Naito (1996). Further we investigate -Dn,K only 
a = 1.0, 1.5, 1.8 for convenience. More extensive simulation studies of -Dn,K for other values of a are left 
to our future works. 

We can compute Dn^^ by (|1.2() . when the values of the estimators have converged. Based on 5000 Monte 
Carlo replications, the upper 10 and 5 percentage points of the statistics Dn,K, 1^ S {1-0, 2.5, 5.0, 10.0} are 
tabulated in Tables |lJ|Slini El for Hi and Table 151 1^ ITUl ITTl for H2. We tabulate simulated values in upper 
row and asymptotic values in lower row in box of each value of a. In the tables of Hi when a and k are 
large, the convergences of Dn.K to D^^ are slower than other values of a and n. This tendency is also seen 
in the tables of H2. This is explained as follows. Since behavior near origin of the characteristic function 
reflects behavior of the tail of distribution, the convergence of the empirical characteristic function near 
origin may be faster than that at distant values when the tail of distribution is heavy. However we find 
the values of -Dn,ft converging for all values of a and k as n — > 00. 

4.3 Analysis of finite sample power 

In this section we examine finite sample power under H2. For alternative hypothesis we consider Student's 
t distribution with j degrees of freedom for j = 1, 2, 3, 4, 5, 10, 00, (t(l) = C(0, 1) and t{co) = A^(0, 1)), 
since stable models are sometimes compared with Student's t models in empirical applications. We 
consider only a = 1.5 and a = 1.8 for null distribution of H2 because the simulation studies are heavy 
when many values of a are considered. Investigations of other values of a are also left to our future works. 

For the significance levels C, = 0.1,0.05, finite sample power of the proposed tests are tabulated in 
Table IT^ and [T31 based on 1000 Monte Carlo replications. We summarize our findings. For a = 1.5 the 
test with K = 10.0 has poor power compared with other values of k and the test with k = 5.0 is the most 
powerful. When alternative hypothesis is i(3) or near t(3) finite sample power is not good. For a = 1.8 
the test with k = 10.0 has also poor power as in the test for a = 1.5. While the test with k = 5.0 is 
the most powerful for heavy tail alternatives, the test with k = 2.5 is more powerful for the light tail 
alternatives. These results are interesting because although the tails of t{i) and f{x;a) are different, the 
distributions are not distinguishable well in finite samples. 

Finally make a remark on how to perform a test of Hi. The problem is that even the asymptotic 
null distribution under Hi depends on the true value of a. In the following remark ^ denotes the 
observed valued of the test statistic for Hi and Z)„^K(?;a) denotes the upper ^ percentage points of the 
null distribution of -Dn,/t for Hi when a is the true characteristic exponent. 

Remark 4.1 We can consider several procedures for Hi. 
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1. Hi is rejected if D^ ,. > sup„g(o,2] -D„,K(^;a), 

2. We consider a G [a,b] where [a,b] C (0,2) is a fixed range and put Hi : a £ [a,b]. Then Hi is 
rejected if D^ ,^ > sup„g[„ 5] Dn,^{^;a). 

3. We plug in the estimate dn in Dn^i^{S,', dn) and Hi is rejected if > Dn^ni^'idn)- 

Though procedure 1 is logically correct for finite sample, it has the drawback that the null hypothesis 
with small true characteristic exponent qq may be rejected by comparing ^ to percentage points near 
a = 2 (See Figures ^ 1(J\) . Therefore procedure 2 and 3 are also worth considering. From the viewpoint of 
asymptotic theory we can use procedure 3. We may use Procedure 2 considering the standard error in dn- 

A Proof of Theorem ^17^ 

For simplicity we definite some constants and functions. 

w" {x) := d^w{x) / dx"^ , wi{t) := + w{t), W2{t) := a\t\°'w{t), 



- / e"2|*l"u'i(t)dt, C2 = / e-2|*l"u'2(t)dt. 



' — oo 



Expanding the estimating equation Qe{6) = {Q^{0),Qf^{9),Qa{0)) = around the true parameter 6q 
(0, 1, a), we have 



where 0* is some value between and 9n- We can write 

1 " 

V^Q^ie) = -—Y^gi{X,), 



n 



(A.l) 



[ j,k=l j=l J 

1 " 

7 = 1 



where 



gi{x) = / sin(ix)ie w{t)dt, 

J —oo 

g2{x) = |cos(te) -e-l*l"| (|t|"log|t|e-l*l")ti;(i)dt, 

/oo 
cos(t(a;i — X2))wi{t)dt, 
-oo 
;"00 

h2{x) = / cos(tx)e~'*' {wi{t) — W2{t))dt — ci + C2, 
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2^JnQcr{0) can be expressed in the form of a [/-statistic 



n 



1 " 1 

— V 2hi{Xj,Xk) } = V^Un + rn, 

n^(n — 1) ^-^ I 

^ ' 3<k ) 



Tn 0, 



where 



Un 



n{n — 1) 



^ E {hi{Xj,Xk)-h2{Xj)-h2{Xk)} 



l<j<k<n 



-1 n 



J2 h{X,,Xk). 
l<j<k<n 



By standard argument on [/"-statistics (Chapter 3 of Maesono (2001), Chapter 5 of Serfiing (1980)) we 
only need to evaluate 

a{xi) = E[h{Xi,X2)\Xi=xi], 



smce 



1 

7 = 1 



0. 



Calculating 



/oo 
cos(tei)e"l*l"u;i(i)(it 
-oo 

and E[h2{X2)\Xi = xi] = E[/i2(^2)] = ci, it can be shown that a{xi) is written as 

/oo 
cos(tei)e~'*'"w2(0^^ ~ ^2- 
-oo 

p 

After showing the convergence of second derivatives dQg{6*)/d6 — > A we can obtain AL representations, 

V^{en-eo) = -A-^V^Qe{eo). 

p 

The proof of dQ{6*)/d9 — > A is as follows. As before the derivatives are evaluated at (/x, a) = (0, 1). 
Write 



da 



{w"{t)\t\'' + 2w'{\t\)\t\)dt 



dQa{0) 

da 



1^ PCX) ^ 

/ cos{tXj)fe~\*\°'w{t)dt, 
n .^^ 7-00 

7:2^2 cos(i(a;j - Xk)) - - XI cos(iXj)e-l*l" + e'^ 

j,k 3=1 
-1 " /-oo 

(cositxj) - e-l*l'')e-l*l''|t|" {(-a^l^l" + + a)w{t) + 2aw'{t)\t\} dt 

j=i -^-^ 

/oo 
e-2|*r|t|2"ti;(t)dt, 
-oo 

- E / (cos(tej) - e-l*l")e-l*l"|t|" [{log |t|(-a|i|" + a + 1) + 1} w{t) + log |t|w'(t)|i|] 

/oo 
e-2|*l"|i|2" log\t\w{t)dt, 
-oo 
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OQaje) 

da 



1 . ^ . 

- ^ / (cos(te,) - e-l*l")(l - |ir)e-l*l"|tr(log \t\fw{t)dt 

^ 1 J — CO 



3=1 

/oo 
e-'^\r\t\^<^{\og\t\fw{t)dt. 
-oo 

Making use of E[cos(tX) — e~l*l"] = and by Fubini's theorem we can calculate their expectations, 



E 
E 



d/j, 
da 



da 
da 



/oo 
-oo 

1 

-Jjl-e-'i'\"){w"it)\t\' + 2w'm\)dt 

/oo 
e-^\'\"\t\'"wit)dt, 
-oo 

/oo 
e-^\^\"\tf''log\t\w{t)dt, 
-oo 

/oo 
e-2|*l"|t|2"(log|t|)2^t;(t)c/i. 
-oo 



E[9(50.(6')/90j] = for other parameters 9i since the density is symmetric. By the weak law of large 
numbers and continuity of integral about parameters we can finish the proof. □ 



B Proofs of Section 2.2 
B.l Proof of Theorem 2.3 

The idea of proofs is essentially the same as those of Giirtler and Henze (2000) based on the original proof 
of Csorgo (1983). However, since parameter a is additionally estimated and stable distribution f{x;ao) 
dose not have the a-moment E[|X|"], a > ao > 0, we reproduce here the outline of the whole proof. 
Note that the kernel k{s,t) is changed from Giirtler and Henze (2000). Before considering Frechet space 
C(R), we first assume the restricted space C{S) of continuous functions on a compact subset S with the 
supremum norm ||/||oo = sup^g^ |/(i)l- Letting k{x,t) = cos(tx) +ism{tx), an alternative representation 
of Zn{t) is given by 




(cos(i/4) + ism{tiSn)) 



= J kix,t)d{V^{Fn{x) - F{x;9n))} ■ 

This is the form of kernel transformed empirical process. We have to check the condition of (i)* (ii)*, 
(iv), (v) and (vi) of Csorgo (1983). Condition (i)* is satisfied from the definition of the kernel k{x,t) = 



19 



cos(tx) + ism{tx). Condition (ii)* is easy following Giirtler and Henze (2000). For < e < uq, 

\ k{x, s) — k{x,t) \ = \J (cos(sx) — cos(tx))^ + (sin(sa;) — sin(tx))^ 



= \/2Vl - cos((s - t)^) 
= 2 I sin((s-t)x/2)| 

< 4|s-^|'/V^''^ 

and £[1X1*^ < co]. For condition (iv) we can check that the covariance matrix E[/(X)/(X)'] for MLE and 
EISE are finite and positive definite from Theorem 12 . II and Theorem 12.21 Condition (v) for EISE is easy 
because Ig are bounded and continuously differentiable from Theorem 12.21 However for MLE we need 
some argument since = fg/ f of Theorem 12.11 has no explicit form. For any compact set IT G R if 
X G K, f{x;a) and fg{x;a) are continuously differentiable with respect to x. Thus hg is exists almost 
everywhere and finite x £ K. To see this we differentiate Fourier inversion formula directly and confirm 
its integrability. Note that the density f{x; a) has no singularity. However as x ^ oo we have to consider 
the tail orders of f{x; a) and /e(x; a) going to 0. We utilize asymptotic expansions of f{x; a) and fg{x; a) 
in Matsui and Takemura (2006): 



f{x;a) 
f'{x;a) 
fa{x;a) 



I f; £(^^(_i)^-i sin(^)x-^"-i 

k=l 
^ oo 



T(ak + 2) 



k=l 

oo 



A;! 



1 ^r{ak + l] 



sm 



+-E 

vr ^ 



(-1) 





' Tiak 




'( 




>- 








vr 




irak 




cos 1 




2 




2 



ka—l 



logx sin 



Irak 



-ka—l 



From expansions and the following relations 

/^(x; a) = -/'(x; a), f„{x; a) = -/(x; a) - x/(x; a), 

we can confirm that Zg's exist almost everywhere and finite except for /i^ at x = oo since ha{x) = 0(log x) 
From the expansions we see that another condition of (v), i.e. 



(B.l) 



Vi{u) = sup ||/(x;0°)| + 

\x\<u L 



< OO, Vn > 0, 



is also satisfied. Note that in the proof of Theorem at p. 527 of Csorgo (1983) non-existence of at x = oo 
is permissible. Therefore condition (v) holds. 

Concerning condition (vi), for symmetric stable distributions elements oiVg^it; 9) = (<I>^(t; 0), <I>(j(i; d), 
^a{t;6)) are written as 

= ^iiii^iill^ = {_sin(^t)+icos(/it)}te-H", 
a/i 

$,(t; 9) = ^^(^i^"^/^'^) = _{cos(^t) + i sin(/it)}e-l"*l" |atra/cT, 

= 9^(^;Q,/^,^) ^ _{cos(^t) + isin(^t)}e-l"*l"|at|"log|at|. 
oa 
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Putting (/Lt, (7, a) = 00 = (0, 1, a) we obtain 

Vemf^o) = a), a)) = (ite"!*!", -e-l*l"|tra, -e"!*!"!*!" log 

Because Vg^{t; 9) is continuous and bounded if {t, 9) e S x Qq, where ©o is a closed parameter space 
sufficiently near the condition is satisfied. Therefore the weak convergence of Z„(t) to a zero mean 
Gaussian process Z is proved in the space {C{S), \\ • \\qo). Since the compact set S is arbitrary, the space 
{C(S), II • I loo) can be extended to Frcchct space C(R) easily. 

The kernel transform of AL representations are given as follows. Write Fq(x) = F(x, 9o) for simplicity. 

1. MLE : 

(B.2) r k{x, s)lg{x)dFo{x) = I-\9)VeHs; 9). 

J —oo 

2. EISE : 

(B.3) j k{x,s)le{x)dFo{x) = A-^ J k{x, s)he{x)dFo{x), 

where 

//•oo 
k{x,s)hf,{x)dFQ{x) = i / e'^'-''^" '^'^^'^ uw{u)du, 
J —OO 

//"OO 
kix,s)haix)dFo{x) = S^e-l*l"-a/ e-l*-"l"-H"|n|"w;(ti)dM, 

/poo 
k{x,s)haix)dFo{x) = 5«e-l"l" - / e-l"-"l"-H"|y|«iog 
J —oo 

Let (•, •) denote the standard inner product of . Write 

^t;9n) = mdo)-{0n-Oo,Vem9*J) 

= J k{x,t)dFo{x) - {9n - 9o,VeHt,9*J), 

where 0* is some value between 9o and 9n- Note that ^* — > 9q. Now replace \fn{9n — 9o) by its AL 
representations. Then Zn{t) is written as 

Zn{t) = J k{x,t)d{V^{Fn{x) - Fo{x))} - (^V^{9n- 9o),VeHt;9*J 
= Z:{t) + A(^\t) + A(^\t), 

where 

Kit) := J fc(x,t)a!{^/^(Fn(x)-Fo(x))} -^-^f^^,(x,),V,$(t;^o] 

(2) (3) 

Z* also converges to Z. The remainder terms A„ ' and A„ ' are defined by 



A(f) := (^V^{9n-9o),V0mOo)-^emdn)), 
A(f) := -{€n,Vg^t;9o)) , e„ = (r„i, r„2, r„3)'- 
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These remainder terms satisfy sup|A^^^| — > 0, and sup|A^^^| — > by conditions (iv) and (vi) of 

tes tes 
Csorgo (1983). The asymptotic process Z has an alternative expression 



Z{t) = j k{x,t)dBFo{x)- (^j l0{x)dBF,-,{x),VeHt;9o 



where Bf^^^x) is the Brownian bridge corresponding to the distribution function Fq, having covariance 
function 'E[BFQ{s)BFQ{t)] = Fq{s A t) — Fo{s)F(){t). Z* and Z have the same covariance function 



(B.4) T{s,t) = <!>{s-t;eo)-Hs;eoMt;eo) + ^s,eoyEMXi)ie{x,yMt,do 



{Ve<S>{s;eo), J k{x,t)le{x)dFoix) j - ^Ve$(t;^o), / k{x, s)le{x)dFo{x) ) . 



Note 



j k{x,s)k{x,t)dFo{x) = J k{x, s - t)dFQ{x) = <^{s - t;eo). 
Evaluating ()B.4|) for the case of MLE and EISE using ()B.2j) and (|B.3|1 proves Theorem ESI □ 



B.2 Proof of theorem 12.41 

We have only to show 

(B.5) VeHs; Oo)' rHe)V^¥{t^ = (veHt;eo), j k{x, s)lE{x)dFo{x] 

Because AL representations can be written by I~^{6)dlogf{x;6Q)/d9, their kernel transformations are 
k{x,s)lE{x)dFo{x) = r\9) j k{x,s)^^^^^^p^dFo{x). 

= r\e) I kix,s)-^^i^ f{x-Mdx 

= I-\9)^ je-^^J^dx 

= i-\e)VeHs-eo). 

Since both sides of the formula (|B.5|) are scalars the proof is over. □ 



B.3 Proof of corollary 2.2.1 

Let a = 1. Then I^^{9) is explicitly written as 



2 

r\e)= \ 2 + l|(7 + log2-l)2 l|(7 + log2-l) 

12 , l^„o 1\ 12 



^(7 + log2-l) 

Letting a = 1 in (|2.18|) and replacing I^^{9) by the above matrix we can prove the corollary. □ 
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Figure 7: Upper quantiles {k = 1.0, Hi) 
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Figure 8: Upper quantiles {k = 2.5, Hi) 
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Table 1: Upper ^ percentage points of under Hi 



a 




1.0 


2.5 


5.0 


10.0 


1.9 


0.1 


1.097 


0.1030 


0.00759 


0.000973 


0.05 


1.352 


0.1338 


0.01000 


0.001267 


1.8 


0.1 


1.037 


0.0963 


0.00977 


0.002283 


0.05 


1.273 


0.1240 


0.01257 


0.002984 


1.7 


0.1 


0.991 


0.0958 


0.01375 


0.003974 


0.05 


1.211 


0.1217 


0.01754 


0.005183 


1.6 


0.1 


0.957 


0.1007 


0.01921 


0.006077 


0.05 


1.161 


0.1258 


0.02431 


0.007895 


1.5 


0.1 


0.933 


0.1108 


0.02615 


0.008650 


0.05 


1.122 


0.1361 


0.03280 


0.011180 


1.4 


0.1 


0.918 


0.1260 


0.03469 


0.011770 


0.05 


1.094 


0.1527 


0.04313 


0.015120 


1.3 


0.1 


0.915 


0.1462 


0.04503 


0.015530 


0.05 


1.078 


0.1754 


0.05553 


0.019813 


1.2 


0.1 


0.925 


0.1717 


0.05740 


0.020042 


0.05 


1.078 


0.2041 


0.07025 


0.025370 


1.1 


0.1 


0.948 


0.2027 


0.07206 


0.025434 


0.05 


1.095 


0.2390 


0.08755 


0.031916 


1.0 


0.1 


0.988 


0.2395 


0.08923 


0.031846 


0.05 


1.130 


0.2804 


0.10765 


0.039574 


0.9 


0.1 


1.044 


0.2824 


0.10903 


0.039422 


0.05 


1.186 


0.3288 


0.13063 


0.048454 


0.8 


0.1 


1.118 


0.3315 


0.13145 


0.048287 


0.05 


1.262 


0.3840 


0.15627 


0.058625 


0.7 


0.1 


1.213 


0.3855 


0.15611 


0.058531 


0.05 


1.362 


0.4446 


0.18397 


0.070081 


0.6 


0.1 


1.325 


0.4413 


0.18217 


0.070175 


0.05 


1.482 


0.5065 


0.21239 


0.082722 


0.5 


0.1 


1.441 


0.4928 


0.20834 


0.083189 


0.05 


1.609 


0.5615 


0.23966 


0.096393 
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Table 2: Upper ^ percentage points of under H2 



a 




1.0 


2.5 


5.0 


10.0 


2.0 


0.1 


1.216 


0.1258 


0.00881 


0.000241 


0.05 


1.499 


0.1622 


0.01177 


0.000335 


1.9 


0.1 


1.150 


0.1129 


0.00921 


0.00142 


0.05 


1.413 


0.1444 


0.01164 


0.00179 


1.8 


0.1 


1.110 


0.1111 


0.01354 


0.00329 


0.05 


1.357 


0.1398 


0.01679 


0.00416 


1.7 


0.1 


1.080 


0.1157 


0.01984 


0.00566 


0.05 


1.313 


0.1431 


0.02457 


0.00713 


1.6 


0.1 


1.058 


0.1256 


0.02773 


0.00858 


0.05 


1.277 


0.1532 


0.03426 


0.01076 


1.5 


0.1 


1.044 


0.1404 


0.03721 


0.01211 


0.05 


1.249 


0.1697 


0.04578 


0.01514 


1.4 


0.1 


1.037 


0.1599 


0.04840 


0.01636 


0.05 


1.231 


0.1919 


0.05927 


0.02037 


1.3 


0.1 


1.039 


0.1840 


0.00118 


0.02114 


0.05 


1.222 


0.2195 


0.07493 


0.02660 


1.2 


0.1 


1.051 


0.2127 


0.07670 


0.02748 


0.05 


1.225 


0.2524 


0.09300 


0.03396 


1.1 


0.1 


1.074 


0.2465 


0.09428 


0.03464 


0.05 


1.242 


0.2907 


0.11376 


0.04262 


1.0 


0.1 


1.111 


0.2862 


0.11445 


0.04307 


0.05 


1.276 


0.3356 


0.13742 


0.05273 


0.9 


0.1 


1.159 


0.3305 


0.13723 


0.05291 


0.05 


1.322 


0.3855 


0.16395 


0.06443 


0.8 


0.1 


1.226 


0.3811 


0.16263 


0.06426 


0.05 


1.389 


0.4424 


0.19326 


0.07779 


0.7 


0.1 


1.310 


0.4365 


0.19018 


0.07714 


0.05 


1.476 


0.5045 


0.22466 


0.09278 


0.6 


0.1 


1.412 


0.4936 


0.21890 


0.09144 


0.05 


1.583 


0.5678 


0.25681 


0.10919 


0.5 


0.1 


1.517 


0.5464 


0.24726 


0.10689 


0.05 


1.696 


0.6250 


0.28773 


0.12661 
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Table 3: Simulation results of symmetric stable distributions (1000 iterations) 



n 


a 


A 


a 


hi 


hi 


-^33 


h2 


^13 


hii 




2.0 





1 


0.5 


2.0 


OO 








* 


50 


1.976 


0.00014 


0.977 


0.607 


1.875 


5.647 


-0.029 


-0.010 


-0.828 


100 


1.990 


0.00017 


0.975 


0.868 


1.402 


9.445 


0.032 


-0.057 


-0.594 


200 


1.994 


0.00094 


0.977 


0.784 


1.239 


12.77 


-0.009 


0.078 


-0.539 




1.8 





1 


0.4552 


1.3898 


0.5937 








-0.3138 


50 


1.818 


0.0012 


0.991 


0.487 


1.231 


0.676 


-0.033 


-0.026 


-0.267 


100 


1.822 


0.0033 


1.002 


0.482 


1.356 


0.584 


0.005 


-0.016 


-0.340 


200 


1.810 


-0.0000 


1.000 


0.450 


1.399 


0.603 


-0.007 


-0.003 


-0.323 




1.5 





1 


0.4281 


0.9556 


0.4737 








-0.2174 


50 


1.548 


-0.0022 


1.012 


0.3161 


0.5796 


0.4252 


0.005 


-0.009 


-0.1927 


100 


1.524 


-0.0000 


1.000 


0.3914 


0.9138 


0.4278 


0.001 


-0.010 


-0.2291 


200 


1.510 


-0.0003 


1.000 


0.4028 


0.9474 


0.4683 


-0.018 


-0.006 


-0.2229 




1.0 





1 


0.5 


0.5 


0.8590 








-0.1352 


50 


1.026 


-0.0029 


0.996 


0.4243 


0.4877 


0.670 


-0.004 


0.0248 


-0.1779 


100 


1.001 


-0.0041 


0.988 


0.4438 


0.4205 


0.746 


0.007 


0.003 


-0.1527 


200 


1.006 


-0.0039 


1.001 


0.4929 


0.5013 


0.845 


-0.003 


-0.0251 


-0.1534 




0.8 





1 


0.6800 


0.3586 


1.3928 








-0.0913 


50 


0.815 


-0.0016 


1.005 


0.5434 


0.3243 


1.111 


-0.01432 


0.0015 


-0.083 


100 


0.811 


-0.0001 


1.003 


0.6015 


0.3459 


1.171 


0.00435 


-0.0313 


-0.097 


200 


0.805 


-0.0009 


1.000 


0.6232 


0.3708 


1.303 


0.01785 


-0.0276 


-0.086 
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Figure 9: Upper quantiles (k = 5.0, Hi) Figure 10: Upper quantiles (k = 10.0, Hi) 
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Table 4: Upper 10 percentage points of -Dioo.k under Hi 



a \ K 


1.0 


2.5 


5.0 


10.0 


1.8 


1.037 

1.037 


0.1037 

0.0963 


0.01271 

0.00977 


0.00326 

0.00228 


1.5 


0.953 
0.933 


0.1213 
0.1108 


0.03061 
0.02615 


0.00996 
0.00865 



1.0 1.032 0.2541 0.09450 0.03336 
0.988 0.2395 0.08923 0.03185 



Table 5: Upper 5 percentage points of -Dioo,k under Hi 



a \ K 


1.0 


2.5 


5.0 


10.0 


1.8 


1.279 
1.273 


0.1334 
0.1240 


0.01681 
0.01257 


0.00451 
0.00298 


1.5 


1.141 
1.122 


0.1504 
0.1361 


0.03968 
0.03280 


0.01348 
0.01118 



1.0 1.216 0.3059 0.11400 0.04230 
1.130 0.2804 0.10765 0.03957 



Table 6: Upper 10 percentage points of -D200,k under Hi 





1.0 


2.5 


5.0 


10.0 


1.8 


1.029 


0.1003 


0.01119 


0.00279 




1.037 


0.0963 


0.00977 


0.00228 


1.5 


0.929 


0.1145 


0.02807 


0.00926 




0.933 


0.1108 


0.02615 


0.00865 


1.0 


1.006 


0.2462 


0.09072 


0.03233 




0.988 


0.2395 


0.08923 


0.03185 



Table 7: Upper 5 percentage points of -D200,k under Hi 



a \ K 


1.0 


2.5 


5.0 


10.0 


1.8 


1.290 


0.1292 


0.01467 


0.00386 




1.273 


0.1240 


0.01257 


0.00298 


1.5 


1.125 


0.1419 


0.03575 


0.01222 




1.122 


0.1361 


0.03280 


0.01118 


1.0 


1.161 


0.2887 


0.10938 


0.04015 




1.130 


0.2804 


0.10765 


0.03957 
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Table 8: Upper 10 percentage points of -Diooa under H2 



a\K 1.0 2.5 5.0 10.0 

1.8 1.100 0.1126 0.01415 0.00356 

1.110 0.1111 0.01354 0.00329 

~~Lh 1.030 0.1403 0.03709 0.01202 

1.044 0.1404 0.03721 0.01211 

Table 9: Upper 5 percentage points of -Dioo,k under H2 

a\K 1.0 2.5 5.0 10.0 

1.8 1.333 0.1397 0.01878 0.00533 

1.357 0.1398 0.01679 0.00416 

1.5 1.220 0.1690 0.04710 0.01590 

1.249 0.1697 0.04578 0.01514 

Table 10: Upper 10 percentage points of -D200.f, under H2 



a\K 1.0 2.5 5.0 10.0 

1.8 1.077 0.1108 0.01369 0.00332 

1.110 0.1111 0.01354 0.00329 

1.5 1.037 0.1414 0.03754 0.01204 

1.044 0.1404 0.03721 0.01211 

Table 11: Upper 5 percentage points of -D200,k under H2 

~a\K 1.0 2.5 5.0 10.0 

1.8 1.368 0.1409 0.01818 0.00470 
1.357 0.1398 0.01679 0.00416 

1.5 1.244 0.1695 0.04620 0.01561 

1.249 0.1697 0.04578 0.01514 

Table 12: Power of Dn,K under H2 (a = 1.5, significance levels ^ = 0.1, 0.05, n = 100) 





0.1 


0.05 




1.0 


2.5 


5.0 


10.0 


1.0 


2.5 


5.0 


10.0 


iV(0,2) 


44 


68 


76 


45 


30 


49 


45 


2 




82 


93 


96 


95 


75 


90 


92 


92 


t{2) 


16 


20 


17 


15 


9 


12 


10 


8 


t{3) 


11 


10 


5 


3 


6 


6 


2 


1 


t(4) 


12 


12 


9 


3 


7 


6 


3 





t{5) 


14 


15 


15 


5 


8 


9 


5 





tm 


26 


40 


42 


15 


14 


22 


16 
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Table 13: Power of -Dn,K under H2 (a = 1.5, significance levels ^ = 0.1, 0.05, 





0.1 


0.05 




1.0 


2.5 


5.0 


10.0 


1.0 


2.5 


5.0 


10.0 


A^(0,2) 


79 


99 


100 


100 


63 


96 


99 


98 


t{l) 


98 


99 


100 


100 


96 


99 


100 


100 


t{2) 


24 


26 


21 


16 


16 


18 


14 


10 


t{3) 


12 


10 


10 


9 


6 


6 


5 


3 


t(4) 


16 


24 


32 


27 


8 


12 


16 


8 


t(5) 


23 


40 


50 


47 


12 


26 


33 


19 


/.(lO) 


48 


83 


93 


92 


34 


G6 


84 


G8 



Table 14: Power of -Dn,K under H2 {a = 1.8, significance levels ^ = 0.1, 0.05, 





0.1 


0.05 




1.0 


2.5 


5.0 


10.0 


1.0 


2.5 


5.0 


10.0 


iV(0,2) 


14 


14 


3 





8 


8 


1 





t{l) 


98 


100 


100 


100 


96 


100 


100 


100 


t{2) 


55 


72 


77 


70 


41 


60 


65 


55 


t{3) 


28 


35 


34 


22 


20 


25 


23 


12 


t{A) 


16 


18 


14 


6 


10 


10 


6 


3 


t(5) 


11 


12 


9 


4 


6 


6 


3 


1 


t(10) 


9 


10 


3 





4 


5 


1 






Table 15: Power of under H2 (a = 1.8, significance levels ^ = 0.1, 0.05, 





0.1 


0.05 


K 


1.0 


2.5 


5.0 


10.0 


1.0 


2.5 


5.0 


10.0 


N{0,2) 


20 


23 


18 





10 


12 


4 





t{l) 


99 


100 


100 


100 


97 


99 


100 


100 


t{2) 


83 


94 


94 


89 


72 


88 


89 


80 


m 


45 


54 


49 


33 


28 


37 


37 


21 


t{4) 


24 


25 


18 


7 


13 


15 


9 


4 


t{5) 


16 


17 


9 


3 


7 


9 


4 


1 


t{10) 


13 


12 


9 


1 


5 


6 


2 






30 



